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Information on Sequence ID No. 1: 
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Length: 1260 base pairs 

Type: Nucleic acid and amino acid sequences 

derived from it 
Strand form: single strand 

15 Topology: linear 

Origin: Listeria monocytogenes strain EGD 

Serotype l/2a 

Feature: Sequence of the dapE gene, which is 

one of the key enzymes needed for 
ypj 20 synthesis of diaminopimelic acid. The 

38 amino acid sequence is highly 

!l homologous to N-succinyl-L- 

Q diaminopimelic acid desuccinylase 

Ul (dapE) from e.g. Escherichia coli, 

% 25 Bacillus subtilis, Lactobacillus 

ffi spp., Mycobacterium tuberculosis. 



Amino acid sequence: 318 amino acids 
Nucleotide sequence: 1260 nucleotides 
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1 TGCCTTTATA GAGAACGGGA AAACATAGAG TGGAATTCAT AGAAAGAGGG 

51 CGTGAAATAT GGACCAACAA AAAAAGATTC AAATTTTAAA GGACTTGGTA 

101 AATATTGATT CGACTAATGG GCATGAAGAA CAAGTTGCGA ACTATTTGCA 
151 AAAGTTGTTA GCTGAACATG GTATTGAGTC CGAAAAGGTA CAATACGACC 
35 201 TAGACAGAGC TAGCCTAGTA AGCGAAATTG GTTCCAGTAA CGA GAA GGT T 

REG 

251 TG GCA TTT TCA GGG CAT ATG GAT GTA GTT GAT GCG GGT GAT GTA TCT AAG 
LAFSGHMDVVDAGDVSK- 
3 01 TGG AAG TTC CCA CCT TTT GAA GCG ACA GAG CAT GAA GGG AAA CTA TAC GG 
40 WKF PPFEATEHEGKLYG- 

3 51 A CGC GGC GCA ACG GAT ATG AAG TCA GGT CTA GCG GCG ATG GTT ATT GCA A 



3 - 



5 RGATDMKSGLAAMVIA- 
401 TG ATT GAA CTT CAT GAA GAA AAA CAA AAA CTA AAC GGC AAG ATC AGA TTA 

MI ELHEEKQKLNGK I R L - 
451 TTA GCA ACA GTT GGG GAA GAG ATC GGT GAA CTT GGA GCA GAA CAA CTA AC 
LATVGEEIGELGAEQLT- 
10 501 A CAA AAA GGT TAC GCA GAT GAT TTA CAT GGT TTA ATC ATC GGC GAA CCG A 
QKGYADDLHGLI I G E P - 
551 GT GGA CAC AGA ATC GTT TAT GCG CAT AAA GGT TCC ATT AAT TAT CCC GTT 

S GHR IVYAHKGS I N Y P V - 
601 AAA TCC ACT GGT AAA AAT GCC CAT AGT TCG ATG CCG GAA TCT GGT GTG AA 
15 KS TGKNAHS SMPE S GVN- 

651 T GCG ATT GAT AAC TTG CTG CTA TTT TAT AAT GAA GTA GAA AAA TTC GTG A 
AIDNLLLFYNEVEKFV- 
701 AA TCA GTT GAT GCT ACT AAC GAA ATA TTA GGC GAT TTT ATT CAT AAT GTC 
KSVDATNEILGDFIHNV 
20 751 ACC GTA ATT GAT GGT GGA AAT CAA GTC AAT AGT ATC CCT GAA AAA GCA CA 
T V I DGGNQVNS I P E K A Q - 
801 A CTG CAA GGG AAT ATT CGC TCG ATT CCA GAA ATG GAT AAT GAA ACA GTG A 
LQGNI RSI PEMDNETV- 
851 AA CAA GTG CTA GTG AAG ATT ATC AAT AAG TTA AAC AAA CAG GAA AAT GTG 
25 KQV LVK I I NKLNKQ ENV- 

901 AAT CTG GAA TTA ATA TTT GAT TAT GAT AAA CAA CCA GTA TTT AGT GAT AA 

N L EL I FDYDKQPVF S D K - 
951 A AAT TCG GAT TTA GTC CAC ATT GCT AAG AGC GTA GCA AGC GAC ATT GTC 
NSDLVHIAKSVASDIV 
30 1001 AAA GAA GAA ATC CCA TTA CTC GGT ATT TCC GGA ACA ACC GAT GCA GCA GA 
KEEIPLLGISGTTDAAE- 
1051 A TTT ACC AAA GCT AAG AAA GAG TTC CCA GTG ATT ATT TTT GGA CCA GGA A 
F TKAKKE F PV I I F G G G - 
1101 AC GAA ACC CCT CAC CAA GTA AAC GAA AAT GTT TCT ATA GGA AAT TAT TTG 
35 NETPHQVNENVSIGNYL- 
1151 GAG ATG GTA GAT GTT TAC AAA CGG ATT GCC ACC GAG TTT TTA TCT TGA TGA 

EMVDVYK RIATEFLS STOP 
12 01 AACTTTAACT TTACTTATTT CCCGATATAA AATAAGTAAT TAATAGAAGT 
1251 CTAGTATTTG 12 60 
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5 Information on Sequence ID No. 2: 



Length: 
Type: 

10 Strand form: 
Topology: 
Origin: 
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Feature : 



1337 base pairs 

Nucleic acid and amino acid sequences 
derived from it 
single strand 
linear 

Listeria monocytogenes strain EGD 
l/2a 

Sequence of the "cold shock protein" 
cspl; this protein is essential for 
the viability of Listeria at low 
temperatures . 
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Amino acid sequence: 66 amino acids 
20 Nucleotide sequence: 1337 nucleotides 
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30 



35 
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1 


GAGGCAAGTG 


GACTAATCAT 


AAAGTTTTTG 


GCGATGCAAC 


TGCGTTTTG 


51 


GCAGGAGATG 


CTTTACTAAC 


GCTCGCTTTT 


TCTATTTTAG 


CTGAAGACGA 


101 


TAATTTATCT 


TTTGAGACAC 


GCATTGCTTT 


GATTAACCAA 


ATTAGTTTTA 


151 


GTAGCGGTGC 


AGAAGGAATG 


GTTGGTGGTC 


AACTTGCAGA 


CTTGGAAGCG 


201 


GAAAACAAAC 


AAGTGACGCT 


AGAAGAGTTA 


TCATCCATTC 


ATGCACGAAA 


251 


AACGGGTGAA 


TTATTAATTT 


ATGCTGTAAC 


CTCTGCAGCA 


AAAATTGCGG 


301 


AAGCTGATCC 


AGAACAAACG 


AAACGCTTAC 


GAATTTTTGC 


AGAGAATATT 


351 


GGGATTGGAT 


TTCAAATTAG 


CGACGATATT 


TTAGATGTAA 


TTGGTGATGA 


401 


AACGAAAATG 


GGTAAAAAGA 


CAGGGGCCGA 


CGCTTTTCTG 


AATAAAAGTA 


451 


CCTATCCCGG 


ATTACTCACG 


CTTGATGGGG 


CAAAAAGGGC 


ATTAAATGAG 


501 


CATGTTACGA 


TTGCAAAGTC 


AGCGCTTTGA 


GGGCATGATT 


TCGATGATGA 


551 


AATTCTCTTG 


AAACTTGCTG 


ATTTAATCGC 


ACTTAGAGAA 


AATTAATCAT 


601 


AATTATCTAG 


TAATTTCAAA 


ATTTTTTCAC 


ATATATAATT 


CAAATTGATT 


651 


TGCTTTTCCT 


AAAATACCGT 


GTTATACTAA 


TGTAAGATTA 


TTTTTGTGGG 


701 


TGAAAGATAC 


GATTGTGAAC 


AACTTTCCAT 


CTCGTGCCGT 


TAAGCAAGAA 


751 


TAGTAAATAA 


TTAGTGTGCA 


TAACACACGA 


GGAGGAACAT 


GAAC ATG GAA 



M E 

801 CAA GGT ACA GTA AAA TGG TTT AAC GCA GAA AAA GGA TTT GGT TTT ATC GA 
QGTVKWFNAEKGFGF IE 
851 A CGC GAA AAC GGT GAC GAT GTA TTC GTA CAT TTC AGC GCT ATC CAA GGC G 
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5 RENGDDVFVHFSAIQG 
901 AC GGA TTC AAA TCT TTA GAC GAA GGT CAA GCA GTA ACT TTC GAC GTT GAA 
DGFKSLDEGQAVTFDVE 
951 GAA GGC CAA CGC GGA CCT CAA GCA GCT AAC GTT CAA AAA GCG TAA TTC TA 
EGQRGPQAANVQKA STOP 
10 1001 TTTTTTGAAT AAGAAAAAGC AAATCATTTC GGTGATTTGC TTTTTTATTT 
1051 GTCTAAAATT ATTTTACCTT GTTTGGTTTA ATGGCGATTG TTTGCTATAA 
1101 TAAGAACAAT TAATCGAGAA AAAAGACCTT GCACGCATTC ATGCGAGTGG 
1151 CTCTTTGGAA AGTGAGTTGT TTTTATTTGG ATCTTTTAAA GATAAAGGAT 
1201 CCTTCCTTTA TGAAGCGATT GGATATACAA GAATTAGAAG CACTTGCAGC 
15 1251 GGATATTCGC GCTTTTTTAA TTACTTCTAC ATCTAAATCA GGTGGGCATA 
1301 TTGGTCCGAA TCTTGGTGTG GTAGAACTAA CGATTGC 




